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IMPROVEMENTS IN OR RELATING TO ORGANIC COMPOUNDS 



Scope of the Invention 

The present invention relates to plants having reduced susceptibility to infection with tospoviruses. 
genetic material used to generate this tolerance, probes for the isolation or diagnosis and processes for 
obtaining such plants and genetic material and probes. 



Introduction 



In general, virus infections have a variety of detrimental effects on e.g. growth, morphology and yield of 
plants Also, virus infections often result in higher susceptibility of infected plants to other plant pathogens 
and plant pests. Transmission of plant viruses occurs generally by insect or fungal vectors or mechanically. 

Plant breeders are trying continuously to develop varieties of crop plant species tolerant to or resistant 
to specific virus strains. Traditionally virus resistance genes are transferred from wild relatives into the 
commercial varieties by breeding. The transfer of an existing resistance from the gene pool of wild relatives 
to a cultivar is a tedious process in which the resistance gene(s) first have to be identified in a source 
(donor) plant species and the resistance genes have to be combined with the perfect gene pool of a 
commercial variety. Resistances or tolerances generated in this way are in many cases only active against 
one or a few strains of the virus In question. Also, breeding cultivars for resistance to a particular virus 
species is often limited by a lack of genetic sources for this resistance within the .crop species. O her 
approaches to prevent or decrease the effect of virus disease on plants are the use of chemicals or other 
methods which act against the virus vectors such as e.g. the use of insecticides, fungicides or good 
phytosanitary working conditions. However, the use of chemicals to combat virus disease by killing the 
vector is subject to increasing stricter regulations for use imposed by governments, confronting the growers 
with a decreasing scala of allowed chemical plant-protectants. 

Alternatively, a system called "cross-protection" can be employed. Cross-protection is a phenomenon 
in which infection of a plant with one strain of a virus protects the plant against superinfection with a second 
related virus strain. Thus, this latter method preferentially involves the use of avirulent virus strains to infect 
plants, to inhibit a secondary infection with a virulent strain of the same virus. However, the use of this 
natural cross-protection system has several disadvantages. The method is very laborious because it 
requires inoculation of every planted crop, and carries the risk that due to a mutation the former avirulent 
strain becomes a more virulent strain, thereby creating a disease by itself. It is also possible that an 
avirulent virus strain on one plant species acts as a virulent strain on another plant species. 

Several studies indicated that the viral coat protein of the protecting virus plays an important role in the 
cross-protection and that protection occurs when the resident virus and the challenging virus have the same 
or a closely related coat protein structures. 

With the recent development of genetic manipulation and plant transformation systems new methods to 
create virus resistance have emerged. Genetically engineered cross-protection Is a form of virus resistance 
which phenotypically resembles the natural cross-protection, but is achieved through expression of the 
genetic Information of the viral coat protein from the genome of a genetically manipulated plant The first 
successful experiments generating virus resistance by genetic engineering were performed by Beachy et 
al (1985) and Abel et al. (1986). They showed that expression of the tobacco mosaic virus strain U1 (TMV- 
U1) coat protein gene from the genome of a transgenic plant resulted in a delay of symptom development 
after infection with any TMV strain. Similar results with respect to coat protein-mediated protection have 
been obtained for alfalfa mosaic virus (AMV). potato virus X (PVX) and cucumber mosaic virus (CMV). 

Although TMV, CMV, AMV and PVX belong to different virus groups, they share a common architec- 
ture: in all these virions the viral RNA is a positive strand RNA encapsidated by a viral coat consisting of 
many individual but identical viral coat proteins. 

Tospoviruses are essentially different from these plant virions. The genus of tospoviruses belongs to the 
Bunyaviridae. All tospoviruses are thrips transmitt d. The virus particles are spherically shaped (80-120 nm 
in diameter) and contain Internal nucleocapsids surrounded by a lipid envelope studded with glycoprotein 
surface projections. The multipartite gen m consists of linear single stranded RNA molecules of negative 
or ambisense polarity. The terminal nucleotides of these RNA molecules are characterised by a consensus 

sequ nee as follows: 5 AGAGCAAUX. GAUUGCUCU 3 . wh rein X is C or U. Typical members 

of th tospoviruses ar tomato spotted wilt virus (TSWV) and Impatiens necrotic spot virus, also known as 
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Th TSWV virion contains 4 distinct structural proteins: an internal nucleocapsid protein N of 29 kd and 
two membrane glycoproteins: G, of approximately 78 kd and G 2 of approximately 58 kd. In addition, minor 
amounts of a larg protein L of approximately 260 kd have been detected in virus particles. The genome of 
TSWV consists of three linear single stranded RNA molecules of ±2900 nt (S RNA). ±5000 n (M RNA) and 
±7500 nt (L RNA). each tightly associated with nucleocapsid proteins and a few copies of the L protein to 
form circular nucleocapsids. A schematic structure outlining most properties of the TSWV vmon is given in 
figure 1. Based on these and other properties TSWV has been classified as a representative of the 
tospoviruses group. Moreover. TSWV is considered as the typemember of the tospoviruses. Only cir- 
cumstantial evidence was presented that suggested that a M RNA encoded gene is directly or ^indirecUy 
involved in the synthesis of the 6, membrane glycoprotein (Verkle.j and Peters. 1983). The coding 
properties of the other RNA molecules and the polarity of the genomic RNA were st.ll unknown pnor to this 

inV9 As°stated earlier, tospoviruses such as TSWV are transmitted by certain thrips species The vectors of 
TSWV belong to the family of Tripidae and include tobacco thrips (Frankliniella teca (Hinds.)), western 
flower thrips (F.occldentalls (Pergande)). common blossom thrips (F. Scnutee/ (Trybom) chilh hnps 
(Sclrtothrlps dorsalls (Hood)). Thrips setosus (Moulton), onion thrips (T. tabaci (Undeman)). F.mtonsa 
and T. palml. Virus is acquired by the thrips only during their larval stages. Urvae can transmit the virus 
before they pupate but adults more commonly transmit the virus. Adults can remain infective throughout 

20 theirMe span.^ ^^.^ ^ J$VA/ a(| ^ kneels makes it one of the most widely distributed 

plant viruses. The virus is widespread in temperate, subtropical and tropical climate zones throughout the 
world. At least 370 plant species representing 50 plant families, both mono- and dicotyledons, are naturalfy 
infected. The TSWV seriously affects the production of food and ornamental crops. Infections of plants with 
TSWV strains result generally in e.g. stunting, ringspots. dark purple-brown sunken spots, stem browning, 
flower breaking, necrotic and pigmental lesions and patterns, yellows and non-necrotic mottle mosaic in 
greens or even total plant death. Most hosts only exhibit a part of these symptoms. The wide range of 
symptoms produced by TSWV has complicated the diagnosis and led to individual diseases being given 
several different names. Also. TSWV symptoms within the same plant species may vary depending on the 
age of the plant, time of infection during the life-cycle of the plant, nutritional levels and environmental 
conditions, especially temperature. 

Although TSWV has been known for many years, is widely distributed, and causes economically 
important diseases in crops and ornamentals, limited progress has been made to identify sources for TSWV 
resistance genes. A mutagenic TSWV tolerance has been identified in Lycoperslcon peruvlanum ,M ft* 
resistance has not been transferred yet to cultivated tomatoes nor has a resistance source been identified 
for other crop species. The use of natural cross-protection systems to decrease the damage by severe 
TSWV strains is hot well documented. Umited positive results have been reported for tomato and lettuce. 

Therefore, the introduction of genetic information conferring resistance to tospovirus infection into plan 
gene pools by means of genetic manipulation provides breeder and grower a new method to combat 
40 tospoviral diseases. 
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Detailed Description 

45 The present invention provides recombinant DNA constructs comprising a DNA sequence coding for 
transcription into 

a) an RNA sequence of tospoviruses or an RNA sequence homologous thereto. 

b) an RNA sequence according to a) encoding for a tospovirus protein in which one or more cottons 
have been replaced by their synonyms (i.e. codons corresponding to the same amino acid or term.nat.on 

so signal), or a part thereof; or an RNA sequence homologous thereto, or 

c) an RNA sequence complementary to an RNA sequence according to a) or b). which DNA is under 
expression control of a promoter functioning in plants and has a terminator functional in plants. 

The DNA sequenc s defined under a), b) and c) hereinabove, are. hereinafter, for convenience referred 
to as a -TSWV R lated DNA Sequences". A TSWV Related DNA Sequence according totte invention may 
55 b modified, if desired, to create mutants or modified sequences homologous to a TSWV delated DNA 
Sequence from which they are derived using methods known to those skilled in th art such as site-directed 
mutag nesis. Such mutants or modified coding sequences are therefore within the spirit and the scope of 



this invention. 
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The term RNA sequence of a tospovlrus refers to a sequence of the S. M or L RNA strand, in particular 
of the S or L RNA strand, preferable of the S RNA strand of a tospovirus. 

Th term RNA sequenc homologous to an RNA sequence of a tospovirus refers to a RNA sequence of 
a tospovirus wherein a number of nucleotides have been deleted and/or added but is still capable of 
s hybridization to a nucleotide sequence complementary to an RNA sequence of a tospovirus under 
appropriate hybridization conditions. For the purpose of the invention appropriate hybridization conditions 
conveniently include an incubation for 16 hours at 42* C. in a buffer system comprising 5 x standard saline 
citrate (SSC). 0.5% sodium dodecylsulphate (SDS). 5 x Denhardt's solution. 50% formamide and 100 ug/ml 
carrier DNA (hereinafter the buffer system), followed by washing 3 times with a buffer compnsing 1 x SSC 
io and 0.1 % SDS at 65* C for approximately one hour each time. 

Preferred hybridization conditions for the purpose of the invention involve incubation in the buffer 
system for 16 hours at 49* C and washing 3 times with a buffer comprising 0.1 x SSC and 0.1% SDS at 
55* C for approximately one hour each time. Most preferred hybridization conditions for the purpose of the 
invention involve incubation in the buffer system for 16 hours at 55 C and washing 3 times with a buffer 
75 comprising 0.1 x SSC and 0.1 % SDS at 65 * C for approximately one hour each time. 

The length of the TSWV Related DNA Sequence will i.a. depend on the particular strategy to be 
followed, as will become apparent from the description hereinafter. In general, it will be desirable that the 
TSWV Related DNA Sequence comprises at least 20. suitably 50 or more nucleotides. 

The term promoter as used herein refers to the nucleotide sequence upstream from the transcriptional 
20 start site and containing all the regulatory regions required for transcription including the region coding for 
the leader sequence of mRNA (which leader sequence comprises the ribosomal binding site and initiates 
translation at the AUG start codon). : 

Examples of promoters suitable for use in DNA constructs of the invention include viral, fungal, 
bacterial, animal and plant derived promoters functioning in plant cells. The promoter may express the DNA 
25 consBtutively or differentially. Suitable examples of promoters differentially regulating DNA expression are 
promoters inducible by disease vectors, such as thrips. e.g. so-called wound inducible promoters. It will be 
appreciated that the promoter employed should give rise to the expression of a TSWV Related DNA 
Sequence at a rate sufficient to produce the amount of RNA necessary to decrease the tospovirus 
susceptibility of the transformed plant. The necessary amount of RNA to be transcribed may vary with the 
30 type of plant involved. Particularly preferred promoters include the cauliflower mosaic virus 35S (CaMV 
35S) promoter, derivatives thereof, and a promoter inducible after wounding by a disease vector such as 
thrips. e.g. a wound inducible promoter. 

The term terminator as used herein refers to a DNA sequence at the end of a transcnptional unit that 
signals termination of transcription. Terminators are DNA 3 -non-translated sequences that contain a 
as polyadenylation signal, that causes the addition of polyadenylate sequences to the 3 -end of the pnmary 
transcript. Terminators active in plant cells are known and described in the literature. They may be isolated 
from for example bacteria, fungi, viruses, animals and plants. Examples of terminators particularly suitable 
for use in the DNA constructs of the invention include the nopaline synthase terminator of A. tumefaaens. 
the 35S terminator of CaMV and the zein terminator from Zea mays. 
40 According to the terminology employed in the present specification, an RNA sequence is complemen- 
tary to another RNA sequence if it is able to form a hydrogen-bonded complex with it. according to rules of 
base pairing under appropriate hybridization conditions (as defined hereinabove). 

The invention also provides a vector capable of introducing the DNA construct of the invention into 
plants and methods of producing such vectors. ■ 
4S The term vector employed herein refers to a vehicle by means of which DNA fragments can be 
incorporated in a host organism. ■■ 

The term plants is used herein in a wide sense and refers to differentiated plants as well as 
undifferentiated plant material such as protoplasts, plant cells, seeds, plantlets etc. that under appropnate 
conditions can develop into mature plants, the progeny thereof and parts thereof such as cuttings and fruits 
so of such plants. 

The invention funhcr provides plants comprising in their genome a DNA construct of the invention, and 
methods of producing such plants. 

The plants according to the invention have reduced susceptibility to diseases induced by tospoviruses 
and do not have the disadvantages and limitations of plants obtained by the classical methods as discussed 
55 her inabov . 

Examples of plants susceptible to tospoviruses such as TSWV include but are not limited to Ageratum. 
alfalfa. Amaranthus. Anthirrhinum. Aquilegia. aubergine, beet. Begonia, broad bean, broccoli, bruss Is 
sprouts, cabbage, cauliflower, celery, chicory. Chrysanthemum. Cineraria, clover, Cosmos, cowpea, cu- 
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cumber, cyclamen. Dahlia. Datura. Delphinium, endive. Gerbera. Gladiolus. Glox.n,a. gourd, groundnut. 
Hippeastrum. Impatiens. lettuce, melon. Mesembryanthemum. onion, papaya, pea. peanut, pepper, petunia, 
pineapple, potato. Primula. Saint Paulia. safflower. Salpiglbssis. snap bean, soybean, spinach, squash, 
sugarbeet, sunflower. Tagetes. tobacco, tomato, Verbena. Vinca. watermelon. Zinnia. The invention relates 
in particular to these listed plants comprising in their plant genome a DNA construct of the invention. 

Since TSWV is the typemember of the tospoviruses, the particular features of tospoviruses are 
hereinafter illustrated employing TSWV as an example. „ „„, . -^..oonn 

The S M and L RNA are single stranded RNA molecules. The S RNA is approximately 2.900 
nucleotides long and comprises two genes, one encoding a non-structural protein (NSs) in viral sense, the 
other one encoding the nucleocapsid protein (N) in viral complementary f sense. The .ntergenic region 
between the NSs- and N-gene can be folded into a hairpin structure. The 5 -and 3 -terminal sequences of 
the S RNA are capable of hybridizing to each other such that the first nucleotide is opposite (and 
complementary) to the last nucleotide of said S RNA strand. We designate hereinafter the double-stranded 
structure obtained by hybridizing both RNA termini for convenience "pan-handle". 

The M RNA strand has approximately 5000 nucleotides. It contains one long open reading frame in viral 
complementary sense. This open reading frame is translated on polysomes located on the endoplasmic 
reticulum where the nascent polypeptide chain is cleaved co-translationally to form the spike proteins G, 
and G 2 respectively. Similar to the S RNA the terming of the M RNA strand are complementary to each 
other and may likewise hybridize to form a "pan-handle". , 

The L RNA strand has approximately 7500 nucleotides and contains one open reading frame in the viral 
complementary sense. This open reading frame most probably corresponds with the gene encoding | the 
viral transcriptase with an estimated molecular weight of approximately 260 kd In some mutant strains 
shortened L RNA molecules have been found in addition to the wild type, full-length L RNA These 
shortened L RNAs however, do always possess the characteristic terminal nucleotide sequences and thus 
are capable of forming "pan-handle" structures. They are also encapsidated with nucleocapsid protein and 
included in virus particles. Their presence supresses symptom development resulting in less severe 
detrimental effects. Hence, these shortened L RNA molecules can be regarded as defective mterfenng (Dl) 

RNAs. a term known to those skilled in the art. w 

A preferred embodiment of the invention relates to DNA constructs of the invention coding or 
transcription into tospovirus-RNA sequences of a "pan-handle", or into tospovirus-RNA sequences homolo- 

9 ° U Al!otheTpreferred embodiment of the invention relates to DNA constructs of the invention coding for 
transcription into tospovirus-RNA sequences of an open reading frame in viral complementary sense (.. e. 
having negative polarity), or into corresponding RNA sequences in which one or more codons have been 
35 replaced by their synonyms, or into RNA sequences homologous thereto. 

A further preferred embodiment of the invention relates to DNA constructs of the invention coding for 
transcription into tospovirus-RNA sequences of a hairpin, or into RNA sequences homologous thereto. 

Preferably the tospovirus-RNA sequence referred to hereinabove have at least 20. more preferably at 

40 lMS £rtes 0 if d DNA constructs suitable for use according to the invention include TSWV Related DNA 
Sequences coding for transcription into (reference is made to Fig. 4 for S RNA nucleotide sequences, figure 
6A for M RNA nucleotide sequences and figure 6B for L RNA nucleotide sequences): 
i) the S RNA nucleotide sequence 1 to 2915; 
i!) the S RNA nucleotide sequence 89 to 1483; 
45 iii) the S RNA hairpin; 

iv) the S RNA "pan-handle"; 

v) the S RNA nucleotide sequence 2763 to 1 987; 

vi) the L RNA nucleotide sequence 4462 to 1; 

vii) the L RNA nucleotide sequence 41 to 2; 

so viii) the L RNA nucleotide sequence 3980 to 46; . ,.. . 

ix) the L RNA nucleotide sequence (6706 + n) to (4462 + n); whereby n is the number of nucleotides of 
the gap between nucleotide 4462 (U) and the subsequently identified nucleotide (C) as shown in Rg. 6B. 

x) the L RNA nucleotide sequence (6706 + n) to (6016 + n); wherein n is as defined hereinabove; 

xi) the L RNA "pan-handle"; 

ss xii) an RNA sequence complementary to the S RNA nucleotide sequence 1987 to 2763; 
xiii) an RNA s quence complementary to the S RNA nucleotide sequence 89 to 1483; 
xv) an RNA s quence complem ntary to th L RNA nucleotid sequ nc 1 to 4462; 
xv) an RNA sequence complementary to the L RNA nucleotid sequence 2 to 41 ; 
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xvi) an RNA sequence complem ntary to the L RNA nucleotide sequence 46 to 3980; 

xvii) an RNA sequenc complementary to the L RNA nucleotide sequence (4462 + n) to (6706 + n) t 
wherein n is as defined above; ^ . 
xvill) an RNA sequence complementary to the L RNA nucleotide sequence (6016 + n) to (6706 + n), 

wherein n is as defined above; 

xix) S RNA nucleotide sequence 89 to 1483 in which one or more codons have been replaced by their 

synonyms; . , ..... 

xx) S RNA nucleotide sequence 2763 to 1987 in which one or more codons have been replaced by their 

synonyms; , , .... 

xxi) L RNA nucleotide sequence 3980 to 236 in which one or more codons have been replaced by their 

synonyms; 

xxii) L RNA nucleotide sequence 4462 to 236 in which one or more codons have been replaced by their 

synonyms; . 

xxiii) L RNA nucleotide sequence (6672 + n) to (4462 + n) in which one or more codons have been 

replaced by their synonyms; 

xxiv) L RNA nucleotide sequence (6672 + n) to (6016 + n) in which one or more codons have been 
replaced by their synonyms; 

xxv) the M RNA nucleotide sequence (m - 574) to m, wherein m is the total number of nucleotides of the 

mrna; u i 

xxvi) an RNA sequence complementary to the M RNA nucleotide sequence (m - 574) to m. wherein m is 
as defined hereinabove; 

xxvii) RNA sequences homologous ot the nucleotide sequences defined under i) to xvm). xxv) or xxvi) 

hereinabove; MM _ ■ 

xxviii) the M RNA nucleotide sequence from (m - 26) to (m - 574) in which one or more of the codons 

have been replaced by their synonyms. 

xxix) fragments of sequences defined under i) to xxviii) hereinabove. 

Preferred TSWV Related DNA Sequences code for transcription into the RNA sequences according to 
ii), iii). iv), v), viii), ix) or xi) as defined above, or into RNA sequences homologous thereto, or into fragments 
thereof comprising at least 20, more preferably at least 50 nucleotides. 

According to another preferred embodiment of the invention the DNA constructs of the invention 
comprise TSWV Related DNA Sequences coding for transcription into a combination of the 5 and 3 
terminal sequences of the viral S, M or L RNA respectively, more preferably of the S or L RNA, in particular 
of the S RNA. 

The invention further provides probes to diagnose suspected plants for infection with tospoviruses such 
as TSWV. Such probes comprise a labeled oligonucleotide (RNA or DNA) sequence complementary to an 
RNA sequence of tospoviruses such as TSWV (for the definition of the terms complementary see 
hereinbefore). The desired length of the sequence and appropriate method for diagnostic use of probes are 
known by those skilled in the art A suitable probe will conveniently comprise a sequence of at least 15 
preferably more than 30, more preferably from about 400 to 600 nucleotides complementary to an RNA 
sequence of tospoviruses such as TSWV. 

Probes according to the invention are useful, in that they allow a person skilled in the art to identify 
tospovirus RNA or parts thereof in Infected plant material, e.g. for diagnostic purposes prior to full 
expression of the disease symptoms. m 

The invention accordingly also provides a diagnostic method of determining tospovirus infection in 
plants which comprises detecting tospovirus replicative forms employing the probes of the invention to dot- 
blot type assays. 

Probes according to the invention are useful, in that they allow a person skilled in the art to construct 
and to use chimeric genes comprising a DNA sequence corresponding to an RNA sequence of tospovirus. 

The DNA constructs of the invention may be obtained by insertion of the TSWV Related DNA 
Sequence in an appropriate expression vector, such that it is brought under expression control of a 
promoter functioning in plants and its transcription terminated by a terminator. 

The term appropriate expression vector as used herein refers to a vector containing a promoter and a 
terminator which function in plant cells. 

The insertion of the TSWV Related DNA Sequence into an appropriate expression vector may be 
carried out in a manner known per se. Suitable proc dur s are also illustrated by the xamples hereinafter. 

Ukewise th construction of appropriate expression vector may be carried out in a manner known per 

se. 

Th plants according to the invention may be obtained by 
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a) inserting into the g noma of a plant cell a DNA construct according to the invention; 

b) obtaining transformed c lis; and 

c) regenerating from the transformed cells genetically transformed plants. tn TS vW 

£S££JESl f^^or organized p-ant structures, the use of 
nS the use of laser systems liposomes, or viruses or pollen as transformat.on vectors and the like. 
^?Z?cSS52i^5 ^ monitored for expression of a TSWV Related DNA Sequence by 
mfi thTd S Imown , I ittTS including Northern analysis. Southern analysis. PCR techniques and/or .m- 
Inolodc^hn oues ^e plants of the invention show decreased susceptibiliry to TSWV in action as 
jSltS whereby said plants are exposed to TSWV preferentially at a concentration m the 
iS^£^SSmL» correlates linearly with the TSWV concentration in the mocukna 
^ods suitSle for TSWV inoculation are known in the art; they include mechanics, inoculation and. ,n 

^^ZS^^cm aiso be obtained by crossing of a obtained p.ant according to 
the l£^f£ESEn with" another p.ant to produce plants having in their p.ant genome a DNA 

^2£5S2^d by the fonowlng non-limitative examples and the attached figures. 
Rgure 1 gives an overview of the structure of the tomato spotted win .virus. 
Figure 2 gives a review of the cloning strategy employed for TSWV S RNA. 

oSe nonSniaural pnMn NSs (88-1483) Is oulllmd tfxm. thai ol tha nuolaooapsd prawn N (2763- 

FiL. s olvas a schematic review ol the con»tnic»on ol a suitabls expression vaclor (pZU-B). 

3£ J *I . -SS. r„1a» o, the conslrae«on tf a a*!* plasrald (pTSWV-N) oompnsing fh. 

„ "C TltTsTslneSo re*, o. Use conslreo«.n « a so,,** ^ (pTSWV-NSs, pomp** 

^ff."aS"S. o, « o, a pta. in—on ,ee* pTSVW-ma. a 

NSsAB a DNA construct according to the invention. ^xcxaa/ 
Figure .14 gives a schematic example of the construction of a plant transformation vector pTSWV- 
NSsmut BB. a DNA construct according to the invention. 
45 Figure 15 shows the -pan-handle" region of TSWV S RNA. 
Rgure 16 shows the hairpin region of TSWV S RNA 

of the hirpi Sure of S RNA or of RNA sequences homologous Jereto are «^<^*£ 
so ?5&3-1708 nucleotide sequence of S RNA. for the 1709-1835 nucleotide sequence of S RNA. for the 1583- 
835 nucleotide sequence of the S RNA or for a sequence homologous to such sequences. 

SuS exa^i of preferred TSWV Related DNA Sequences coding for transcription into a sequence 
Q f the Wh3 region of S RNA or of RNA sequences homologous thereto is the combination of the 
CenceTSg S* 1-70 and 2.850-2916 nucleotide s quence of S RNA. or for sequences 

M h ° m Se^iTof^fired TSWV Related DNA Sequences coding for transcription into a "pan- 
„ hi « ^in^Ti RNA or of RNA seou nces homologous th reto is the combination of sequences coding 
£2 fiS au^pTuSr I" h7£» £ «ho 5-end of the «* L RNA and th .as, 80. in 
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particular the last 65 nucleotides from the 3-end of the viral L RNA, or for sequences homologous to such 
sequenc s or derivatives thereof. . 
Other objects, features, advantages of the present invention will become apparent from the following 

examples. 

References are abbreviated to the first authors name, full references are listed later. 



Material and methods 

All TSWV RNA derived sequences presented in here are depicted as DNA sequences for the sole 
purpose of uniformity only. A person skilled in the art appreciates that this is done for convenience only. 

Cultivars of Nicotians tabacum and Petunia hybrida, used in plant transformation studies, are grown 
under standard greenhouse conditions. Axenic explant material is grown on standard MS media (Murashjge 
and Skoog, 1962) containing appropriate phytohormones and sucrose concentrations. 

E coli bacteria are grown on rotary shakers at 37* C in standard LB-medium. Agrobactenum 
tumefaciens strains are grown at 28' C in MinA medium supplemented with 0.1 % glucose (Ausubel et a/., 
1 987) 

In all cloning procedures the £ coii strain JM83, (P", A{lac-pro) t ara t rpsU 080. d/acZM15) is used as 
a recipient for recombinant plasmids. . 

Binary vectors are conjugated to Agrobacterium tumefaciens strain LBA 4404. a strain containing the 
Ti-ptasmid vir region, (Hoekema ef a/., 1983) in standard triparental matings using the E. coil HB101, 
containing the plasmid pRK2013 as a helper strain. (Figurski and Helinski. 1979). Appropriate Agrobac- 
terium tumefaciens recipients are selected on media containing rifampicin (50 ug/ml) and kanamycine (50 
Ug/ml) 

Cloning of fragments in the vectors pUC19 (Yanish-Perron et a/., 1985), pBluescript (Stratagene), 
P BIN19 (Bevan ef a/.. 1984) or derivatives, restriction enzyme analysis of DNA, transformation to E coii 
recipient strains, isolation of plasmid DNA on small as well as large scale, nick-translation, in vitro 
transcription, DNA sequencing, Southern blotting and DNA gel electrophoresis are performed according to 
standard procedures (Maniatis ef a/., 1982; Ausubel et at., 1987). 



Examples 

Example 1: Isolation of TSWV particles and the genetic material therein 

Tomato spotted wilt virus strain CPNH1, a Brazilian isolate from tomato, is maintained on tomato by 
grafting. Virus is purified from systemically infected Nicotians rustica leaves, after mechanical inoculation 
essentially as described by Tas ef a/. (1977). It is essential to maintain all material used in this isolation 
procedure at 4 * C. Twelve days after inoculation of 100 grams of infected leaves are harvested and ground 
5 - 10 seconds at low speed setting in 5 volumes extraction buffer (0.1 M NaHzPO*. 0.01 M Na 2 S0 3 pH 7) 
in a Waring blender. The suspension is filtered through cheesecloth and the filtrate is centrifuged for 10 
minutes at 16,000 x g. The resulting pellet Is resuspended in three volumes resuspensionbuffer (0.01 M 
NaH 2 PO*. 0.01 M Na 2 S0 3 . pH 7). The pellet is dissolved by stirring carefully at 4 C. After centrifugation 
for 10 minutes at 12,500 x g the pellet is discarded and the suprnatant is centrifuged again for 20 minutes 
at 50,000 x g. The pellet is resuspended in 0.2 volume of resuspensionbuffer (0.01 M NaHzPCU, 0.01 M 
Na 2 S0 3 , pH 7) and kept on ice for 30 minutes. Antiserum raised in rabbits against material from non- 
infected' Nicotiana rustica is added to the solution, carefully stirred for 1 hour and non-viral complexes are 
pelleted by 10 minutes centrifugation at 16.000 x g. The cleared supernatant is loaded on a linear 5 - 40 % 
sucrose gradient in resuspensionbuffer: 0.01 M NaH 2 PO*. 0.01 M Na 2 S0 3 . pH 7, and spun for 45 minutes 
at 95 000 x g. The opalescent band containing TSWV virions is carefully collected with a synnge and 
diluted 4 times with resuspensionbuffer. Washed virions are pelleted by centrifugation for 1.5 hours at 
21 000 x g and resuspended in one volume resuspensionbuffer. Generally, 100 grams of leaf matenal yields 
approximately 0.5 mg of TSWV virions. TSWV RNA is recov red preferentially from purified virus 
pr parations by SDS-phenol extractions followed by ethanol precipitation. From 1 mg TSWV 1-5 ug of RNA 
is extracted. The intactness of the isolated RNA molecules is analysed by electrophoresis on an agarose 
gel. Three distinct RNA molecules are identified with apparent sizes of 2900 nucleotides (S RNA). 5000 
nucleotides (M RNA) and 7500 nucleotides (L RNA) r spectively. 
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Example 2: Sequence determination of the 3'-termini of the TSWV viral RNAs 

For direct RNA sequencing TSWV RNA is extracted from purifi d nucleocapsids essentially according 
to Verkieij et a/. (1983). Twelve days after inoculation 100 grams of infected I aves are harv sted and 
grounds 5-10 seconds at low speed setting in four volumes TAS-E buffer (00 M EDTA, 0.01 MMJ. 

01 % cysteine, 0.1 M TRIS pH 8.0) in a Waring blender. The suspens.cn .s filtered through cheesecloth 
and centrifuged 10 minutes at 1.100 x g. Nucleocapsids are recovered from this supernatant by 30 mmutes 
centrifugation at 66.000 x g. The pellet is carefully resuspended in one volume TAS-R buffer (1 /. Nonidet 
NP-40 001 M EDTA 0 01 M ItfeSO.. 0.1 % cysteine. 0.01 M glycine. 0.01 M TRIS pH 7.9). The pellet .s 
dissolved by stirring carefully for 30 minutes at 4 *C. The supernatant is cleared by a 10 minutes 
centrifugation at 16.000 x g. Crude nucleocapsids are collected from the cleared supernatant by segmenta- 
tion through a 30 % sucrose cushion for 1 hour at 105.000 x g. The nucleocaps.d pellet is s resuspended in 
400 ul 0.01 M Na-citrate pH 6.5. layered on a 20 - 40 % sucrose (in 0.01 M Na-atrate pH 6.5 and spun for 

2 hours at 280.000 x g. The three different opalescent bands, respectively L. M and S nucleocaps.d, are 
collected separately. TSWV RNA is recovered preferentially from purified nucleoc a P s 'd ^ep^at.ons by 
SDS-phenol extractions followed by ethanol precipitation. Routinely 100 ug of RNA are obtained from 100 
grams of infected leaves. The 3-ends of the separate TSWV RNAs are labeled using RNA ligase and 5 - 
FPIpCp. These end-labeled RNA molecules are separated on a low gelling temperature agarose gel 
(Wieslander. 1979). The enzymatic approach described by Clerx-Van Haaster and Bishop (1980) and Clerx: 
Van Haaster ef a/. (1982) is used to determine the 30 terminal nucleotides of the 3 - and 5 -ends of both S 

^stntheUc oligonucleotides complementary to the 3'-termini are synthesized using a commercially 
available system (Applied Biosystems) and used for dideoxy-sequencing with reverse transcnptase. 



Example 3: cDNA cloning of TSWV genetic material 

Oligonucleotides complementary to the 3'-ends of the S and L RNA are used for priming first strand 
cDNA synthesis. With these primers, double stranded DNA to TSWV RNA is synthes, zed I in pnnop e 
according to Gubler and Hoffman (1983). Two different approaches are used to generate cDNA .clones I to 
the TSWV viral RNAs. A first series of clones is obtained by random pnminfi I of ^the TSWV RNA , using 
fragmented single stranded calf thymus DNA. followed by first and second A 
made blunt-ended using T4-DNA polymerase and ligated w,th T4 ligase into the ! ma s*e °f pUCI 
second series of TSWV cDNA clones is obtained by priming first strand °NA sy^sis ^th the 
oligonucleotides complementary to the 20 terminal nucleotides at the 3 -ends of the TSWV RNAs. After 
second strand synthesis, treatment with T4 DNA polymerase to create blunt ends ; Pto«horyM &oM 
Sers are ligated to the ends of the cDNAs essentially according to Huynh ef a/. 1985 . After restnchon ^ 
La cDNA molecules with EcoRI, the cDNA fragments are .igated in the EcoR. site of MM. g«0 £DNA 
clones from both series containing viral inserts are selected via colony *^^ t ^^J^^ 
to Grunstein and Hogneas (1975) using pp-labeled, randomly primed first s^"d «D N A as a P*UMi 
of overlapping cDNA clones are selected by Southern analysis followed by P^^^/.^ 9 : 
in order to construct restriction maps, based on cDNA derived sequences of the S (figure 2) and L RNA 
(figure 3). 

Example 4: Sequence determination of the TSWV S. M and L RNA 

in order to determine the sequence of the S RNA 4 selected I cDNA clones are «Wmd in M13mp18. 
resulting in the plasmids P S614. pS608. pS520 and pS514. as described above (flg. 2). These clonesara 
sequenced in both directions using the standard protocol of Yanish-Perron ef ^J^^T^S 
sequence of the 3'-end of the S RNA is determined by primer extens>on of the synthetic oligonucleotide SI 
(5 dfGAGCAATCGTGTCAATTTTG)). which is complementary to the 20 nucleotides of the 3 -terminus. A 
ecoTsX^oioLeotide 83 complementary to the nucleotides 30 to 50 from the 5 -end of me v^ 
8 i RNA is used to verify the 5-terminal sequence of the S RNA by primer extension. Sequenc data from 
the TSWV SRNA (2916 nt) ar summarized in figur 4. 

CoZuter simulated Lnslation of the 6 different reading frames on the viral strand , and I v,r* com- 
olementarv strand reveals the presence of two putative open r ading fram s (se figure 5A). On th vira 
S£ll!d VoZn wSw frame Is found starting at position 89 and terminating at an UAA stopcodon at 
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position 1483 possibly encoding a protein of 464 amino acids with a molecular weight of 52.4 kd. This 
protein is most likely a non-structural protein, tentatively designated NSs. The other open reading frame Is 
located on the viral complementary strand from position 2763 to 1987. encoding a 258 ammo acids long 
polypeptide with a molecular weight of 28.8 kd. This open r ading frame possibly encodes the viral 
nucleocapsid protein N. . 

To verify this assumption the amino acid composition of the purified nucleocapsid protein is determined 
and compared with the deduced composition from the sequence data. Also, the putative nucleocapsid 
encoding N protein gene is Inserted into pBluescript. Using T7 RNA polymerase the N protein gene is 
transcribed in vitro. This in vitro synthesized transcript is subsequently translated in an in vitro rabbit 
reticulocyte lysate (New England Nuclear: NEN) system In the presence of PS] labeled methionine (NEN). 
A protein with a molecular weight identical to the native nucleocapsid protein could be precipitated from the 
synthesized radioactive proteins, when antibodies raised against purified nucleocapsid protein are used, 
following procedures essentially as described (Van Grinsven ef a/. 1986). This indicates that the N protein 
gene encodes the viral nucleocapsid protein. Figure 5A shows the coding capacities of the viral and the 
viral complementary strand of the S RNA indicating the NSs and N protein gene respectively, both genes 
being expressed from subgenomic mRNAs. Thus, the unique situation occurs that a plant virus RNA has an 
ambisense gene arrangement Other important features of this S RNA sequence is the existence of 
complementary terminal repeats capable of forming so-called "pan-handle" structures. These structures 
play an important role in replication and transcription of the viral RNA. Another putative regulatory element 
is the hairpin stucture in the intergenic region of the S RNA which most likely contains the transcription 
termination signals for both subgenomic mRNAs. encoding respectively the N and NSs-protein. 

The nucleotide sequence of the TSWV M and L RNA was elucidated employing the same strategies 
and methods followed to determine the nucleotide sequence of the S RNA. Figure 3 shows the cloning 
strategy for the L RNA derived, overlapping cDNA clones. Summarized sequence data of the TSWV M and 
L RNA are given in figure 6A and 6B respectively. Computer simulated translation suggests that this viral L 
RNA is of negative polarity containing one single open readinp frame which starts 34 nucleotides from the 
3'-end of the viral RNA and stops at nucleotide 236 from the 5 -end of the viral RNA (figure 5B). The mRNA 
(complementary to the viral L RNA) containing this open reading frame is most probably translated into the 
viral transcriptase. 



Example 5: Construction of an expression vector pZU-A 

The 35S cauliflower mosaic virus (CaMV) promoter fragment is Isolated from the recombinant plasmid 
pZ027 a derivative of pUC19 carrying as a 444 bp Hindlll-Pstl fragment the Hincll-Hphl region of the 35S 
promoter of CaMV strain Cabb-S (Franck et ai.. 1980). The nucleotide sequences of CaMV strains are very 
similar for the different strains. The 35S promoter fragment is excised from pZ027 as a 472 bp EcoRI-Pstl 
fragment which contains: a part of the polylinker region. 437 bp of the non-transcribed region, the 
transcription initiation site and 7 bp of the non-translated leader region but not containing any 35S 
translations initiators. This 35S promoter fragment is ligated using T4 ligase into EcoRI-Pstl lineanzed 
pZO008 This plasmid pZO008 carries the nopaline synthase (NOS) terminator as a 270 bp Pstl-Hindlll 
fragment The resulting recombinant plasmid pZU-A carries the 35S promoter, a unique Pstl site and the 
NOS terminator (figure 7). The sequence of the used 35S promoter in the plant expression vector pZU-A is 
as follows: 

1 AAGCTTCTAG AGATCCGTCA ACATGGTGGA GCACGACACT CTCGTCTACT 
51 CCAAGAATAT CAAAGATACA GTCTCAGAAG ACCAAAGGGC TATTGAGACT 
101 TTTCAACAAA GGGTAATATC CGGAAACCTC CTCGGATTCC ATTGCCCAGC 
151 TATCTGTCAC TTCATCAAAA GGACAGTAGA AAAGGAAGGT GGCACCTACA 
201 AATGCCATCA TTGCGATAAA GGAAAGGCTA TCGTTCAAGA TGCCTCTGCC 
251 GACAGTGGTC CCAAAGATGG ACCCCCACCC ACGAGGAGCA TCGTGGAAAA 
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301 AGAAGACGTT CCAACCACGT CTTCAAAGCA AGTGGATTGA TGTGATATCT 
351 CCACTGACGT AAGGGATGAC GCACAATCCC ACTATCCTTC GCAAGACCCT 
401 TCCTCTATAT AAGGAAGTTC ATTTCATTGG AGAGGACCCT GCAG 

The complete sequence from the used NOS terminator in the vector pZU-A is as follows: 

1 CTGCAGATCG TTCAAACATT TGGCAATAAA GTTTCTTAAG ATTGAATCCT 
51 GTTGCCGGTC TTGCGATGAT TATCATATAA TTTCTGTTGA ATTACGTTAA 
101 GCATGTAATA ATTAACATGT AATGCATGAC GTTATTTATG AGATGGGTTT 
151 TTATGATTAG AGTCCCGCAA TTATACATTT AATACGCGAT AGAAAACAAA 
201 ATATAGCGCG CAACCTAGGA TAAATTATCG CGCGCGGTGT CATCTATGTT 
251 ACTAGATCTC TAGAAAGCTT 



Example 6: Construction of an expression vector pZU-B 

The recombinant plasmid pZ0347 is a derivative of pBluescript carrying a 496 bp BamHI-Smal 
fragment containing a 426 bp 35S promoter fragment (Hindi fragment) of CaMV strain Cabb-S. linked to a 
67 bp fragment of the non-translated leader region, the so-called fl-region, of the tobacco mosaic v.rus. This 
results in a chimeric promoter with a complete transcriptional fusion between the promoter of CaMV to the 
untranslated leader of TMV. By using in vitro mutagenesis the original position of the TMV ATG startcodon 

is mutated to a Smal site. , 

The plasmid pZO008 carries the nopaline synthase (NOS) terminator as a 260 bp Pstl-H.ndlll fragment. 
This Pstl-Hindlll fragment is excised from pZOOOS and ligated using T4 ligase into Pstl-Hindlll linearized 
pZ0347. The resulting recombinant plasmid pZU-B is another plant expression vector. The sequence of this 
35S-Q promoter as used in the plant expression vector pZU-B is as follows: 

1 GGATCCGGAA CATGGTGGAG CACGACACGC TTGTCTACTC CAAAAATATC 
51 AAAGATACAG TCTCAGAAGA CCAAAGGGCA ATTGAGACTT TTCAACAAAG 
101 TTATTGTGAA GATAGTGGAA AAGGAAGGTG GCTCCTACAA ATGCCATCAT 
151 TGCGATAAAG GAAAGGCCAT CGTTGAAGAT GCCTCTGCCG ACAGTGGTCC 
201 CAAAGATGGA CCCCCACCCA CGAGGAGCAT CGTGGAAAAA GAAGACGTTC 
251 CAACCACGTC TTCAAAGCAA GTGGATTGAT GTGATATCTC CACTGACGTA 
301 AGGGATGACG CACAATCCCA CTATCCTTCG CAAGACCCTT CCTCTATATA 



351 AGGAAGTTCA TTTCATTTGG AGAGGACTTT TTACAACAAT TACCAACAAC 
401 AACAAACAAC AAACAACATT ACAATTACTA TTTACAATTA CCCGGG 

The resulting recombinant plasmid pZU-B contains the 35S Hincll-TMV 0 fusion (35S-Q). unique Smal 
and Pstl sites and the NOS terminator (figur 8). This expression vector is preferentially used in 
constructing tran'slational fusions of the gene to be expressed downstream of the chimeric promoter 35S-Q. 



Example 7: Subcloning of the TSWV-N protein gene 
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The TSWV-N protein coding sequ nee is obtained by fusion of the cDNA clones pS614 and pS520 (see 
figur 2) The cDNA clone pS520 is subjected to EcoRI-Hindlll double-digestion and the fragment 
containing the 3'-end of the TSWV-N protein gene Is separated electrophoretically and purified from the gel 
using a DEAE membrane (NA-45. Schleicher and SchUII) and cloned In th EcoRI-Hindlll lineariz d 
pBluescript (Stratagene) resulting in the recombinant plasmid pS520E/H. The 5 -end containing fragment of 
the TSWV-N protein is excised from pS614 by a EcoRI digestion. This fragment is separated elec- 
trophoretically and purified from the gel using a DEAE membrane (NA-45, Schleicher an c I ScMJJj and 
cloned in the EcoRI linearized pS520E/H resulting in the recombinant plasmid pTSWV-N3. The TSWV-N 
gene containing plasmid pTSWV-N3 is linearized by digestion with Pvull. Pstl linkers 5 d(CCTGCAGG) are 
ligated with T4 ligase to the blunt ends of the linear DNA. Subsequently, the DNA is digested wrth Pstl and 
the fragment containing the TSWV-N protein sequence is separated electrophoretically and excised from 
the gel The TSWV-N protein gene containing fragment is ligated into a Pstl linearized vector such as 
pBluescipt (Stratagene) to yield the recombinant plasmid pTSWV-N (figure 9). This addition of restriction 
sites facilitates the construction of further plasmids. (Alternatively, one may choose to add the sites in 
different ways such as but not limited to site-directed mutagenesis or by ligation of other synthetic 
oligonucleotide linkers. These methods are all known to a person skilled in the art) 

Example 8: Subcloning of the TSWV non-structural protein gene (NSs-gene) of the TSWV S RNA 

The sequence of the non-structural protein NSs is isolated from the cDNA clone pS514. The NSs- 
protein gene is located on an EcoRI fragment After restriction of the cDNA clone pS514 with EcoRI and 
treatment with T4 DNA polymerase to create blunt ends. The NSs gene containing fragment is separated 
electrophoretically on an agarose gel and excised from the gel. To this blunt ended fragment containing the 
NSs-protein gene synthetic Pstl linkers (5' d(CCTGCAGG)-3 ) are ligated using T4 polymerase. After 
restriction with Pstl, the NSs-protein gene containing fragment is ligated in an Pstl linearized pBluescript to 
yield the recombinant plasmid pTSWV-NSs (figure 10). 



30 Example 9: Construction of plant transformation vectors containing TSWV sequences 



Example 9A: N-protein gene constructions in pZU-A 

35 In order to create a plant transformation vector containing the N protein gene driven by the 35S 
promoter and terminated by the NOS terminator, the Pstl fragment of pTSWV-N is isolated and inserted into 
Pstl linearized pZU-A thereby creating the chimeric gene cassette vector pTSWV-NA. The cassette 
containing the 35S promoter, the N-protein gene and the NOS terminator is excised from pTSWV-NA by 
restriction With Xbal and ligated in the unique Xbal site of pBIN19. a binary transformation vector developed 

40 by Bevan et al (1984). The resulting plasmid pTSWV-NAB (figure 11) is used in plant transformation 
experiments using methods well known to a person skilled in the art 
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Example 9B: N-protein constructions in pZU-B 

In order to make a fusion in which the ATG start codon from the N protein gene is fused directly to the 
3'-end of the TMV untranslated leader of the 35S-Q promoter the startcodon of the N gene has to be 
mutated. Using site-directed mutagenesis, the sequence 5 d(ACGATCATCATG TCT) in pTSWV-N is 
mutated to 5 d(ACGATATCATG TCT). thereby creating an EcoRV site: 5 d(GATjATC) just proximal to the 
ATG startcodon of the N geneTThe resulting recombinant plasmid is called pTSWV-Nmut The mutated N 
protein gene is excised from this plasmid via an EcoRV-Pstl digestion. This fragment is isolated and 
inserted into the Smal-Pstl linearized pZU-B. resulting in recombinant plasmid pTSWV-NmutB. The 
chimeric cassette containing th 35S-Q promoter, the mutated N gene and the NOS terminator is excised 
from th plasmid pTSWV-NmutB via a BamHI/Xbal digestion. The isolated chimeric gene cassett is then 
inserted into the BamHI/Xbal linearized pBIN19 to create the binary transformation vector pTSWV-NmutBB. 
The r suiting plasmid pTSWV-NmutBB (figure 12) is used In plant transformation experiments using 
methods w II known to a p rson skilled in the art 
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Example 9C: NSs-protein gene constructions in pZU-A 

In order to create a plant transformation vector containing the NSs-protein gene driven by th 355 
promot r and terminated by the NOS terminator, the Pstl fragment of pTSWV-NSs is isolated and inserted 
into Pstl linearized pZU-B to create the chimeric gene cassette vector pTSWV-NsA The cassette containing 
the 35S promoter, the NSs-protein gene and the NOS terminator is excised from pTSWV-NsA by restriction 
with EcoRI and Xbal and ligated into EcoRI-Xbal linearized pBIN19. The resulting plasmid pTSWV-NsAB 
(figure 13) is used in plant transformation experiments using methods well known to a person skilled in the 
art. 

Example 9 D: NSs-protein gene constructions in pZU-B 

In order to create a fusion in which the ATG start codon from the NSs-protein is fused directly to the 3'- 
end of the TMV leader of the 35S-Q promoter the startcodon of the NSs gene has to be mutated. Using 
site-directed mutagenesis, a procedure known to a person skilled in the art. the sequence 5 ji- 
(AACCATAATG TCT) is mutated to 5' d(AACCATATQ TCT), thereby creating a Ndel site: 5 d(CA|TATG) 
that includeilhe ATG startcodon of the NSs-protein gene. The resulting plasmid is called pTSWV-NSsmut. 
The plasmid pTSWV-NSsmut is linearized with Ndel. followed by a treatment with T4 DNA polymerase to 
create blunt ends. This linearized DNA is digested with Pstl and the fragment containing the mutated NSs 
gene is isolated and inserted into Smal Pstl linearized pZU-B resulting in the recombinant plasmid pTSWV- 
NsmutB. The chimeric cassette containing the 35S-0 promoter, the mutated NSs-protein gene and the NOS 
terminator is excised from the plasmid pTSWV-NsmutB via a BamHI/Xbal digestion. The isolated chimeric 
gene cassette is then inserted into the BamHI/Xbal linearized pBIN19 to create the binary transformation 
vector pTSWV-NsmutBB. The resulting plasmid pTSVW-NsmutBB (figure 14) is used in plant transforma- 
tion experiments using methods well known to a person skilled in the art. 
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Example 9E: 5'- and 3' -termini "pan-handle" constructions in pZU-A and pZU-B 

A DNA analysis programme is used to locate the "pan-handle" loop in the viral, TSWV S RNA. The 
strongest "pan-handle" loop that is detected includes the first 70 nucleotides at the 5 -end (1 to 70) of the 
viral S RNA and the last 67 nucleotides at the 3'-end (2850 to 2916) of the viral S RNA (figure 15). The 
DNA sequence containing this "pan-handle" loop in the viral S RNA is as follows: 

1 AGAGCAATTG TGTCAGAATT TTGTTCATAA TCAAACCTCA CTTAGAAAAT 
51 CACAATACTG TAATAAGAAC 



and 



2850 GTTCTTAATG TGATGATTTG TAAGACTGAG TGTTAAGGTA TGAACACAAA 
« 2900 ATTGACACGA TTGCTCT 

A DNA analysis programme is used to locate the "pan-handle" loop in the viral TSWV L RNA. A strong 
"pan-handle" loop that is detected includes the first 80 nucleotides at the 5 -end (1 to 80) of the viral L RNA 
so and the last 80 nucleotides at the 3'-end of the viral L RNA The DNA sequence containing this "pan- 
handle" loop in the viral L RNA is as follows: 



ss 
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1 AGAGCAATCA GGTACAACTA AAACATATAA CCTCTCCACA GCCAGACTTT 
51 ACAAATTACA TAAGAATTCC CTCCAGTGAA 

and 

AAAGTGGTTC CATTTTCTAT TAATTTTTGT ATTTTCTGGA TGTTCATGTT 
TGCTTAAAAT CGTTGTTACC TGATTGCTCT 

These regions are synthesized on a commercial DNA synthesizer and appropriate linker sequences are 
added. Construction of the "pan-handle" vectors of S and L RNA results in respectively: pTSWV-termS and 
pTSWV-termL Using appropriate restriction enzyme combination these fragments are inserted between the 
CaMV 35S promoter and the NOS terminator of pZU-A or between the 35S-Q promoter and the NOS 
terminator of pZU-B yielding the chimeric cassettes: pTSWV-termSA, pTSWV-termLA, pTSWV-termSB and 
pTSWV-termLB. These cassettes are then transferred into the binary transformation vector pBlN19 using 
appropriate enzyme combinations yielding the following plasmids: pTSWV-termSAB, pTSVW-termLAB, 
pTSWV-termSBB and pTSWV-termLBB. Alternatively, it is possible to design "pan-handle" constucts 
including the 3'- and 5'-end termini that are larger as indicated above, or separated by any other DNA 
sequence in order to enhance the stability of the transcripts produced from these recombinant genes in 
plants. All "pan-handle 11 constructs resemble shortened tospovirus RNA, respectively TSWV RNA molecules 
and therefore can be regarded as defective interfering RNAs. 



Example 9F: Construction containing TSWV S RNA hairpin region in pZU-A and pZU-B. 

A DNA analysis programme is used to locate the hairpin loop in the viral TSWV S RNA. The strongest 
hairpin loop that is detected starts at nucleotide 1592 and ends at nucleotide 1834 (figure 16). The 
sequence containing this hairpin loop is as follows: 

1 TAGTAGAAAC CATAAAAACA AAAAATAAAA ATGAAAATAA AATTAAAATA 
51 AAATAAAATC AAAAAATGAA ATAAAAACAA CAAAAAATTA AAAAACGAAA 
101 AACCAAAAAG ACCCGAAAGG GACCAATTTG GCCAAATTTG GGTTTTGTTT 
151 TTGTTTTTTG TTTTTTGTTT TTTATTTTTT ATTTTATTTT TATTTTATTT 
201 TATTTTTATT TTATTTTTAT TTTATTTATT TTTTGTTTTC GTTGTTTTTG 
251 TTA 

A Hindtll fragment of 526 bp carrying the hairpin region is isolated from pS514. This fragment is 
excised from a agarose gel and subsequently treated with T4 polymerase to create blunt ends. In the 
following step Psti linkers are ligated to these blunt ends. After digestion with Pstl the fragment is cloned in 
Pstl linearized pZU-A, resulting in the recombinant plasmid pTSWV-HpSA. The plasmid pTSWV-HpSA is 
digested with Hindlll and the fragment containing the chimeric gene is excised from an agarose gel and 
ligated into Hindlll linearized pBIN19, resulting in the transformation vector pTSWV-HpSAB. 

Alternatively, the Hindlll fragment of pS514 is treated with T4 DNA polymerase to create blunt ends and 
is subsequently cloned in the Smal site of the expression vector pZU-B, resulting in the recombinant 
plasmid pTSWV-HpSB. The plasmid pTSWV-HpSB is digested with Hindlll and the fragment containing the 
chimeric gene is excised from an agarose gel and ligated into Xbal linearized pBIN19, resulting in the 
transformation vector pTSWV-HpSBB. 

(It is clear to a person skilled in the art that also other fragments can be isolated from the cDNA clones 
of the TSWV S RNA containing the hairpin region as described above without int rferenc with th function. 
Also, a fragment containing the hairpin region may be synthesized using a DNA-synthesizer.) 
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Example 10: Transformation of binary vectors to plant material 

Methods to transfer binary vectors to plant material are well established and known to a person skilled 
in the art Variations in procedures exist due to for instance differences in used Agrobacterlum strains, 
s different sources of explant material, differences in regeneration systems depending on as well the cultivar 
as the plant species used. 

The binary plant transformation vectors as described above are used in plant transformation expen- 
ments according to the following procedures. The constructed binary vector is transferred by tn-parental 
mating to an acceptor Agrobacterlum tumefaciens strain, followed by southern analys.s of the ex- 

,o conjugants for verification of proper transfer of the construct to the acceptor strain, inoculation and 
cocultivation of axenic explant material with the Agrobacterlum tumefaciens strain of choice, selector 
killing of the Agrobacterium tumefaciens strain used with appropriate antibiotics, selection of transformed 
cells by growing on selective media containing kanamycine. transfer of tissue to shoot-inducing media, 
transfer of selected shoots to root inducing media, transfer of plantlets to soil, assaying for intactness of the 

is construct by southern analyses of isolated total DNA from the transgenic plant, assaying for proper function 
of the inserted chimeric gene by northern analysis and/or enzyme assays and western blot analysis of 
proteins. 

20 Example 11: Expression of TSWV RNA sequences in plant cells 

RNA is extracted from leaves of regenerated plants using the following protocol. Grind 200 mg 
leafmaterial to a fine powder in liquid nitrogen. Add 800 Ul RNA extraction buffer (100 mM Tr.s-HCI (pH 
8 0) 500 mM NaCI 2 mM EDTA. 200 mM 0-Mercapto-ethanol. 0.4% SDS) and extract the homogenate with 

as phenol, collect the nucleic acids by alcohol precipitation. Resuspend the nucleic acids in 0.5 ml 10 mM 
Tris-HCI (pH 8 0) 1 mM EDTA. add LiCI to a final concentration of 2 M. leave on ice for maximal 4 hours 
and collect the RNA by centrifugation. Resuspend in 400 ul 10 mM Tris-HCI (pH 8.0), 1 mM EDTA and 
precipitate with alcohol, finally resuspend in 50 ul 10 mM Tris-HCI (pH 8.0), 1 mM EDTA. RNAs are 
separated on glyoxal/agarose gels and blotted to Genescreen as described by van Gnnsven et «M1988). 

30 TSWV viral RNA is detected using DNA or RNA probes labeled with pP], P S S] or by using nonradioactive 
labeling techniques. Based on these northern analysis, it is determined to what extent the regenerated 
plants express the chimeric TSWV genes. . 

Plants transformed with chimeric constructs containing a TSWV protein gene are also subjected to 
western blot analysis. Proteins are extracted from leaves of transformed plants by grinding in sample buffer 

35 according to Laemmli (1970). A 50 ug portion of protein is subjected to electrophoresis in a 12.5 h SDS- 
polyacrylamide gel essentially as described by Laemmli (1970). Separated proteins are transferred to 
nitrocellulose electrophoretically as described by Towbin ef a/. (1979). Transferred proteins are reacted with 
antiserum raised against purified TSWV nucleocapsids according to Towbin ef a/. (1979). Based on the 
results of the western analysis, it is determined that transformed plants do contain TSWV proteins encoded 

40 by the inserted chimeric genes. 

Example 12: Resistance of plants against TSWV infections 

Transformed plants are grown in the greenhouse under standard quarantine conditions in order to 
prevent any infections by pathogens. The transformants are self-pollinated and the seeds harvested. 
Progeny plants are analyzed for segregation of the inserted gene and subsequently infected wrth TSWV by 
mechanical inoculation. Tissue from plants systemically infected with TSWV is ground in 5 volumes of ice- 
cold inoculation buffer (10 mM phosphate buffer supplemented with 1% Na 2 S0 3 ) and rubbed m the 
presence of carborundum powder on the first two fully extended leafs of approximately 5 weeks old 
seedlings. Inoculated plants are monitored for symptom development during 3 weeks after inoculation. 

Plants containing TSWV Related DNA Sequences show reduced susceptibility to TSWV infection as 
examplified by a delay in symptom development, whereas untransformed control plants show severe 
systemic TSWV symptoms within 7 days after inoculation. 

Example 13: Use of synthetic oligonucleotides for diagnostic purposes 
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RNA is xtracted from leaves of suspected plants using the following protocol: grind 1 gram of I af 
material, preferentially showing disease symptoms, in 3 ml 100 mM Tris-HCI. 50 mM EDTA, 1.5 M NaCI 
and 2% CTAB (pH 8.0). After grinding, 1 ml of the homogenate is subjected to chloroform extraction and 
incubated at 65 *C for 10 minutes. The inorganic phas is then collected and extracted with 
phenol/chloroform (1:1), followed by a last extraction with chloroform. The ribonucleic acids are isolated 
from the inorganic phase, containing the total nucleic acids, by adding UCI to a final concentration of 2 M. 
The preparation is incubated at 4 # C for 1 hour, after which the ribonucleic acids are collected by 
centrifugation. The ribonucleic acid pellet is resuspended in 25 ul 10 mM Tris-HCI, 1 mM EDTA (pH 8.0). 
The ribonucleic acids are recovered by standard alcohol precipitation. The ribonucleic acid pellet is 
resuspended in 25 ul 10 mM Tris-HCI. 1 mM EDTA (pH 8.0). 

1 julI of the purified ribonucleic acids is spotted on a nylon blotting membrane (e.g. Hybond-N, 
Amersham UK). The presence of TSWV in the plant is detected by standard hybridization, using any part or 
parts of the sequence isolated from virions or preferentially by designing synthetic oligomers on the basis 
of disclosed sequence information as a probe. (Alternatively, in vitro transcripts of regions of the TSWV 
genome are used to detect TSWV Related RNA Sequences in diseased plants.) A diseased plant is 
diagnosed by the occurrence of hybridization at the dot containing RNA material from the diseased plant. 

Using procedures as described herein RNA is isolated from 12 pepper plants (A1 - D3) suspected to be 
virus infected selected from a field. Similarly, RNA is extracted from 3 TSWV inoculated tobacco plants (E1 
- E3) and from 3 non-inoculated tobacco plants (F1 -F3). From each plant an RNA sample of 1 III is spotted 
onto a Hybond membrane. This filter is hybridized under standard conditions with an in vitro transcript 
synthesized from the cDNA clone pS614 using T3 polymerase.and a-p 2 P]UTP as an radioactive label. 
Control non-infected plants (F1 - F3) do not show a signal, control TSWV infected plants (E1 - E3) do show 
a strong signal, whereas suspected pepper plants all show signals ranging in intensity from weak to strong 
(see figure 17). 
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Claims 



1 . Recombinant DNA constructs comprising a DNA sequence coding for transcription into 
a) an RNA sequence of tospoviruses or an RNA sequence homologous thereto; 

b an RNA sequence according to a) encoding for a tospovirus protein in wh,ch one or more ^codons have 
been replaced by their synonyms (i.e. codons corresponding to the same am.no ac.d or term.nat.on s.gnal). 

or a part thereof: or an RNA sequence homologous there to, or 

c) an RNA sequence complementary to an RNA sequence according to a) or b), 

which DNA is under expression control of a promoter and a terminator functioning in plants. 

2. The DNA constructs of Claim 1, wherein the DNA sequences code for transcnption into: 

i) the S RNA nucleotide sequence 1 to 2915; 

ii) the S RNA nucleotide sequence 89 to 1 483; 

iii) the S RNA hairpin; 

iv) the S RNA "pan-handle"; 

v) the S RNA nucleotide sequence 2763 to 1987; 

vi) L RNA nucleotide sequence 4462 to 1; 

vii) the L RNA nucleotide sequence 41 to 2; 

viii) the L RNA nucleotide sequence 3980 to 46; 

ix the L RNA nucleotide sequence (6706 * n) to (4462 ♦ n); whereby n is the number of nucleoUdes of 
the gap between nucleotide 4462 (U) and the subsequently identified nucleot.de (C) as i shown , in Fig. 6B. 

x) the L RNA nucleotide sequence (6706 + n) to (6016 + n); wherein n is as defined here,nabove; 

xi) the L RNA "pan-handle"; 

xii) an RNA sequence complementary to the S RNA nucleotide sequence 1987 to 2763; 

xiii) an RNA sequence complementary to the S RNA nucleotide sequence 89 to 1483; 

xiv) an RNA sequence complementary to the L RNA nucleotide sequence 1 to 4462; 

xv) an RNA sequence complementary to the L RNA nucleotide sequence 2 to 41 i ; 
J) an RNA sequence complementary to the L RNA nucleotide sequence^ 46 to 3980; 

xvii) an sequence complementary to the L RNA nucleotide sequence (4462 + n) to (6706 - n). wherein 

x^ii)^ d R f NA d sequence complementary to the L RNA nucleotide sequence (6016 + n) to (6706 + n), 

SJ? RNA^ a ucSsXence 89 to 1493 in which one or more codons have been replaced by their 

«)TrNA nucleotide sequence 2763 to 1987 in which one or more codons have been replaced by their 

xx!)°L RNA nucleotide sequence 3980 to 236 in which one or more codons have been replaced by their 

ShJTrnA nucleotid s quence 4462 to 236 in which one or mor codons have been replaced by their 

Xiii)? RNA nucl otid sequenc (6672 + n) to (4462 + n) in which one or more codons hav been 
replaced by their synonyms, wherein n is as defined above; 

Z) L RNA nucleotide sequenc (6672 + n) to (6016 + n) in which on or mor codons have be n 
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replaced by their synonyms, wher in n is as defined above; 

xxv) the M RNA nucleotide sequence (m-574) to m, wherein m is the total number of nucleotides of the 

MRNA; u . 

xxvi) an RNA sequence complementary to the M RNA nucleotide sequence (m-574) to m, wherein m is 

as defined hereinabove; 

xxvii) RNA sequences homologous to the nucleotide sequences defined under i) to xvui), xxv) or xxvi) 
hereinabove; . 

xxviii) the M RNA nucleotide sequence from (m-26) to (m-574) in which one or more of the codons have 
been replaced by their synonyms, wherein m is as defined above; 

xxix) a fragment of a DNA sequence defined under i) to xxviii) hereinabove. 

3. The DNA construct of Claim 1, coding for transcription in tospovirus-RNA sequences of a pan-handle, or 
into RNA sequences homologous thereto. 

4. The DNA construct of Claim 1, coding for transcription into tospovirus-RNA sequences of an open 
reading frame in viral complementary sense, or into corresponding RNA sequences in which one or more 
codons have been replaced by their synonyms, or into RNA sequences homologous thereto. 

5. The DNA construct of Claim 1, coding for transcription into tospovirus-RNA sequences of a hairpin, or 
into RNA sequences homologous there to. 

6. The DNA construct of Claims 1 to 5, coding for transcription into tospovirus-RNA sequences, or into 
tospovirus-RNA sequences in which one or more codons have been replaced by their synonyms, or into 
RNA sequences homologous thereto of at least 20 nucleotides. 

7. The DNA construct of Claim 6, coding for transcription into tospovirus-RNA sequences, or into 
tospovirus-RNA sequences in which one or more codons have been replaced by their synonyms, or into 
RNA sequences homologous thereto of at least 50 nucleotides. 

8. The DNA construct of Claim 1, wherein the DNA sequence codes for transcription into a combination of 
the 5'n and 3' terminal sequences of the viral S or L RNA respectively. 

9. The DNA construct of Claims 1 to 8, wherein the promoter is a viral, fungal, bacterial, animal or plant 
derived promoter functioning in plant cells. 

10. The DNA construct of Claim 9. wherein the terminator is a viral, fungal, bacterial, animal or plant denved 
terminator functioning in plant cells. 

11. A plant comprising in its genome a DNA construct in accordance with Claims 1 to 10. 

12. A probe comprising a single or double stranded oligonucleotide sequence complementary to an RNA 
sequence of a tospovirus. . ' . ' 

13. A probe according to Claim 12, wherein the oligonucleotides are complementary to the tospovirus S 
RNA sequence, the tospovirus L RNA nucleotide sequence or to fragments of such sequences comprising 
at least 15 nucleotides. 

14. A probe according to Claims 12 or 13 wherein the oligonucleotide sequence has from 400 to eoo 
nucleotides. 

15. A process of preparing plants according to Claim 11, which comprises 

a) inserting into the genome of plant cell a DNA construct of Claim 1 , 

b) obtaining transformed cells; 

c) regeneration from the transformed cells genetically transformed plants. 
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THE STRUCTURE OF 
TOMATO SPOTTED WILT VIRUS 
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1 AGA GCA AUO GUG UCA GAA UUU OGU UCA UAA OCA AAC CUC ACU UAG AAA' AUC ACA ADA COG 

met aer aer aer val tyr glu ser lie lie gin 
61 UAA UAA GAA CAC AGU ACC AAU AAC CAU AAO GOC OOC AAG UGO UUA OGA GUC GAU CAU UCA 

thr arg ala aer val trp gly aer thr ala aer gly lya ala val val asp aer tyr trp 
121 GAC AAG AGC UUC AGO COG GGG AOC AAC OGC AUC OGG OAA AGC UGU OGU AGA UUC OUA CUG 

lie his glu leu gly thr gly ser gin leu val gin thr gin leu tyr ser asp ser arg 
181 GAU UCA OGA ACU UGG UAC UGG UUC UCA ACU AGU UCA GAC CCA GCU GUA UUC UGA UUC AAG 

ser lys val val leu trp leu tyt cys lys val gly lie phe pro val lys lys lys arg 
241 AAG CAA AGU AGU CCU UUG GCU AUA CUG CAA AGU AGG GAU CUU CCC UGU GAA GAA GAA GAG 

phe leu ser gin his val tyr ile pro lie phe asp aap lie asp phe aer lie aan lie 
301 AUU UCU UUC UCA GCA UGO GUA UAO CCC UAU UUU UGA UGA UAU UGA UUO UAG CAU CAA UAU 

asp asn ser val leu ala leu ser val cys aer asn thr val asn ala asn gly val lys 
361 UGA UAA COC UGU OCO GGC ACU AOC UGO UUG COC AAA OAC AGO CAA OGC OAA CGG AGO GAA 

his gin gly his leu lya val leu aer pro ala gin leu hia aer ile glu aer He met 
421 ACA UCA AGG UCA UUU GAA GGO UUU GUC UCC UGC CCA GCU CCA CUC UAU UGA AOC UAU CAU 

aan arg aer aap ile thr aap arg phe gin leu gin glu lya aap ile ile pro asn asp 
481 GAA CAG AUC UGA UAU OAC AGA CCG AUU CCA GCU CCA AGA AAA AGA CAU AAU UCC CAA UGA 

lys tyr ile glu ala ala aan lya gly aer leu aer cya val lya glu hia thr tyr lya 
541 CAA AUA CAU UGA AGC UGC AAA CAA AGG CUC UUU GUC UUG UGU CAA AGA GCA UAC CUA UAA 

ile glu met cys tyr aan gin ala leu gly lya val aan val leu aer pro aan arg aan 
601 GAU CGA GAU GUG CUA UAA UCA GGC UUU AGG CAA AGU GAA UGU UCU AUC UCC UAA CAG AAA 

val hia glu trp leu tyr aer phe lys pro aan phe aan gin val glu aer aan aan arg 
661 UGU CCA UGA AUG GCU GUA CAG UUU CAA GCC AAA UUU CAA UCA AGU UGA AAG CAA CAA CAG 

thr val aan aer leu ala val lya aer leu leu met aer ala glu aan aan ile met pro 
721 AAC UGU AAA UUC UCO UGC AGU GAA AUC UCU GCU CAU GUC AGC AGA AAA CAA CAU CAU GCC 

asn ser gin ala ser thr asp ser his phe lys leu ser leu trp leu arg val pro lys 
781 UAA CUC UCA AGC UUC CAC UGA UUC UCA OOO CAA GCO GAG CCO COG GCU AAG GGU UCC AAA 

val leu lys gin val ser ile gin lys leu phe lys val ala gly asp glu thr aan lys 
841 GGU UUU GAA GCA GGO UUC CAU UCA GAA AUO GOO CAA GGO UGC AGG AGA OGA AAC AAA CAA 

thr phe tyr leu ser ile ala cys ile pro asn his asn ser val glu thr ala leu asn 
901 AAC AUU OUA UUU AUC UAU UGC CUG CAU UCC AAA CCA UAA CAG UGU UGA GAC AGC UUU AAA 

ile thr val ile cys lya his gin leu pro ile arg lys cys lys ala pro phe glu leu 
961 CAO UAC UGO 0A0 UUG CAA GCA UCA GCO CCC AAU UCG CAA AUG CAA AGC UCC UUU OGA AOU 

ser met met phe ser asp leu lys glu pro tyr asn ile val his aap pro ser tyr pro 
1021 AUC AAU GAU GUU UOC OGA 000 AAA GGA GCC UUA CAA CAO 0G0 UCA UGA CCC UUC AUA CCC 

lys gly aer val pro met leu trp leu glu thr hia thr aer leu hia lya phe phe ala 
1081 CAA AGG AUC GGU UCC AAU GCU COG GCO CGA AAC UCA CAC AUC UUO GCA CAA GUU CUU UGC 

thr asn leu gin glu asp val ile ile tyr thr leu asn asn leu glu leu thr pro gly 
1141 AAC UAA CUO GCA AGA AGA UGO AAU CAO CUA CAC 000 GAA CAA CCO UGA GCO AAC OCC OGG 

lys leu asp leu gly glu arg thr leu asn tyr ser glu asp ala tyr lys arg lya tyr 
1201 AAA GOO AGA 000 AGG OGA AAG AAC COO GAA OUA CAG UGA AGA UGC CUA CAA AAG GAA AUA 

phe leu ser lys thr leu glu cys leu pro ser asn thr gin thr met ser tyr leu asp 
1261 UUO CCO OUC AAA AAC ACU UGA AUG OCO OCC AOC OAA CAC ACA AAC 0A0 GOC UUA CUU AGA 

aer He gin ile pro aer trp lys ile asp phe ala arg gly " e * v »fi* " r 
1321 CAG CAO CCA AAO CCC OUC AUG GAA GAU AGA CUO UGC CAG AGG AGA AAO UAA AAO UUC UCC 

Figure 4 
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gin aer lie aer val ala lys ser Xeu leu lys leu asp leu aer gly lie lya lya lys 
1381 ACA ADC UAU UUC AGO OGC AAA AUC UUU GUU AAA GCO UGA OOU AAG CGG GAO CAA AAA GAA 

glu ser lya val lya glu ala tyr ala ser gly ser lys OCH 
1441 AGA AUC OAA GGU UAA GGA AGC GUA UGC UUC AGG AUC .AAA AUA AUC UUG CUU UGU CCA GCO 

1501 UUU UCU AAU UAU GUU AUG UUU AUU UUC UUU CUU UAC UUA UAA UUA UUU CUC UGU UUG UCA 

1561 UCU CUU UCA AAU UCC UCC UGU CUA GUA GAA ACC AUA AAA ACA AAA AAU AAA AAU GAA AAU 

1621 AAA AUU AAA AUA AAA UAA AAU CAA AAA AUG AAA UAA AAA CAA CAA AAA AUU AAA AAA CGA 

.1681 AAA ACC AAA AAG ACC CGA AAG GGA CCA AUU UGG CCA AAU UUG GGU UUU GUU UUU GUU UUU 

1741 UGU UUU OUG UUU UUU AUU UUU UAU UUU AUU UUU AUU UUA UUU UAU UUU UAU UUU AUU UUU 

1801 AUU UUA UUU AUU UUU UGU UUU CGU UGU UUU UGU UAU UUU AUU AUU UAU UAA GCA CAA CAC 

1861 ACA GAA AGC AAA CUU UAA UUA AAC ACA CUU AUU UAA AAU UUA ACA CAC UAA GCA AGC ACA 

1921 AGC AAU AAA GAU AAA GAA AGC UUU AUA UAU UUA UAG GCU UUU UUA UAA UUU AAC UUA CAG 

1981 CUG CUU UCA AGC AAG UUC UGC GAG UUU UGC CUG CUU UUU AAC CCC GAA CAU UUC AUA GAA 
OPA ala leu glu ala leu lys ala gin lys lys val gly phe met glu tyr phe 

2041 CUU GUU AAG AGU UUC ACU GUA AUG UUC CAU AGC AAC ACU CCC UUU AGC AUU AGG AUU GCU 
lys asn leu thr glu ser tyr his glu met ala val ser gly lya ala asn pro aan aer 

2101 GGA GCU AAG UAU AGC AGC AUA CUC UUU CCC CUU CUU CAC CUG AUC UUC AUU CAU UUC AAA 
aer aer leu lie ala ala tyr glu lys gly lys lys val gin asp glu asn met glu phe 

2161 UGC UUU GCU UUU CAG CAC AGU GCA AAC UUU UCC UAA GGC UUC CUU GGU GUC AUA CUU CUU 
ala lys ser lys leu val thr cys val lys gly leu ala glu lys thr asp tyr lys lya 

2221 UGG GUC GAU CCC GAG GUC CUU GUA UUU UGC AUC CUG AUA UAU AGC CAA GAC AAC ACU GAU 
pro asp lie gly leu aap lya tyr lya ala asp gin tyr lie ala leu val val aer lie 

2281 CAU CUC AAA GCU AUC AAC UGA AGC AAU AAG AGG UAA GCU ACC UCC CAG CAU UAU GGC AAG 
met glu phe aer aap val aer ala lie leu pro leu aer gly gly leu met lie ala leu 

2341 UCU CAC AGA CUU UGC AUC AUC GAG AGG UAA UCC AUA GGC UUG AAU CAA AGG AUG GGA AGC 
arg val aer lya ala aap asp leu pro leu gly tyr ala gin lie leu pro his ser ala 

2401 AAU CUU AGA UUU GAU AGU AUU GAG AUU CUC AGA AUU CCC AGU UUC UUC AAC AAG CCU GAC 
lie lys ser lya ile thr aan leu aan glu aer aan gly thr glu glu val leu arg val 

2461 CCU GAU CAA GCU AUC AAG CCU CCU GAA GGU CAU GUC AGU GCC UCC AAU CCU GUC UGA AGU 
arg ile leu aer asp leu arg arg phe thr met aap thr gly gly ile arg aap aer thr 

2521 UUU CUU UAU GGU AAU UUU ACC AAA AGU AAA AUC GCU UUG CGU AAU AAC CUU CAU UAU GCU 
lya lya ile thr ile lya gly phe thr phe aap aer gin lya ile val lya met ile ser 

2581 CUG ACG AUU CUU UAG GAA UGU CAG ACA UGA AAU AAC GCU CAU CUU CUU GAU CUG GUC GAU 
gin arg aan lya leu phe thr leu cya aer ile val aer met lya lya ile gin aap ile 

2641 GUU UUC CAG ACA AAA AGU CUU GAA GUU GAA UGC UAC CAG AUU CUG AUC UUC CUC AAA CUC 
aan glu leu cya phe thr lya phe aan phe ala val leu aan gin aap glu glu phe glu 

2701 AAG GUC UUU GCC UUG UGU CAA CAA AGC AAC AAU GCU UUC CUU AGU GAG CUU AAC CUU AGA 
leu aap lya gly gin thr leu leu ala val ile aer glu lya thr leu lya val lya aer 

2761 CAU GAU GAU CGU AAA AGU UGU UAU AUG CUU UGA CCG UAU GUA ACU CAA GGU GCG AAA GUG 
met 

2821 CAA CUC UGU AUC CCG CAG UCG UUU CUU AGG UUC UUA AUG UGA UGA UUU GUA AGA CUG AGU 
2881 GUU AAG GUA UGA ACA CAA AAU UGA CAC GAO UGC UCU 

Figure 4 
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1 AGAGCAAUC • 

CAUAGA UCAUCAAAUA 

UGCOAAAOGC CUGCAAUAGA GAGGAAUAAU 
ACUCUACUUC UGAUGAAGAA GCADAACUCU 
UGAAAUGAGC UUCAAUGC0U COAAACAACA 
UCAGCAUUUU AUCAACCUCU CCACUGGCUA 
CGAUCCAUCU AGUUUGGAAA UCACAGDUUU 
UCAUUGCACG AGAGUGAUAA ACGGAAGACA 
UGCUAACAGG GUAUGUGUUU GUCGUCGAGC 
UUUGCCCAAA AGUUUCUUGA GCUUUUCUUA 
CCUUCUUCAA UUUAACAUAC AUUAGAAUCU 



ACUUAUAUGC GAAGAUUCUG UGCAACAGUU 
CGCUCCAUUU ACUAAUUCUU UCAGAUAUAA 
UUUUUGGGUU CAAAGUUAUG CAAAAACUUU 
UUUCUGGCAG AGAl5GAGCUU GAAAAGUCUG 
UUAAUGAUGU UGCAUUAUCA UCAGAGUGCA 
UGUAACAUUC CAAAGUCUUG UGAUAAGCUU 
GAUAAUUUAA AUUGCCUUGC AACCAAUUCA 
CUUUGGUCGA AAGUCCUAUG GCAGUUUCAU 
GAUUCAAAAA AAUAUCUGUU GGUAUACAAA 
UGAUGUAUUG AUUGCUCU (app.nt5000) 



Figure 6A 
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1 AGAGCAAUCA GGOACAACUA AAACAUAUAA 
61 UAAGAAUUCC CUCCAGUGAA ACUAUACCGG 
121 GUXJCACCAAA UUGUUUUAGG CUAAGUAGAU 
181 AUAUUCUAAG UUAUAUCUAA UCUGUUAUUC 
241 UCUGUGUCUU CUUCUUCAUC AAGCUCAUCU 
301 UAACUAAGCA AGAAAUUGUC UUCAUCUGUU 
361 CUCAAAAUAA UCUCUCUUGU UUGCAUGAUA 
421 CCAUACAAUG UUXJUUUCUGA UAAACAUAGU 
481 UUAUAGACCU CUAGOUUUUU AGUUCGCGAU 
541 UUAACUCUUC CAGACAAGAA AUGUUCCUUG 
601 UCUAAGCUGU CAGCUAUCUC AUUUAAAGGA 
661 GUUGCAUAAA CUUUGAGUGC AGGUACACUG 
721 AAUAUGUCUG GUCUAAAACU GUUCCGUAAC 
781 AUUCCUCCAA UCAAGAACUU UAUCAACUCA 
841 AACCCUUUCU UUUUCCUUUU AAUUUUCAUU 
901 UCAUCUAGCC UCAACACAUU AAAACUGGCA 
961 AUUUUUAGAU UCUCUCCAGA UAUAACAAAC 
1021 UCUGAUAAUU UUAUUUCUGG UAAAACCAGC 
1081 UUGUUUUGCA AGUCCAGAUC AUGAUCCAAU 
1141 CUGCUAAGAA CAUUCCUUAA ACUUGCUAAG 
1201 UCACAAGCUU UAAUCAAUUG GAUCAWJCCU 
1261 UUUUUUGCCA UUAGCUCAAC AGCAUGCAUU 
1321 GCAUCAUUUA UUAGAGAAUA UUUAAGCAUG 
1381 CUAGUAUACU GUUUUGCUUU GUCAACAAUC 
1441 AAGUUGUCAU GUGUUCUUAG CAAGUUUUCG 
1501 UCAUCAAGAU AGCUAUCAUA UUCUUCCACA 
1561 GUAUCCCGGA UGUUCAOUAA GAAUUCAUCC 
1621 UCUUUGAUCU GCUGUGAUAA GUUUCCAAAG 
1681 CUGUUUUUUA UGGUUUUGAU UGUUUCAGUU 
1741 AAAAAUUCCC CUACUUCAGG GAGCCUUAAG 
1801 CAUACAUCGG AAUUCACAUU UUCAUAUUUU 
1861 CUUAUUUGUC UUGAUGUAGC AUUAACCUUU 
1921 AUGCAAGAUC UCCUGGUUCU AAGAAAACAA 
1981 UUAUCAAACA UAAUUUCUGC CAAUAAUUUC 
2041 UACUCUCUUU CAUUCAAUAU CUCAAAUGGU 
2101 GAUGCUAUCA AUUCCACAUA GCAGCUUCUU 
2161 GAGACAUAAU UUCCUGAUUC AUCUUGUUCU 
2221 CUAAUAUACC UGAUAACAAA AUCUAAUGUU 
2281 CAGACAAWJU, GUUUCUGGAU UGUGUUCAAU 
2341 ACUGUGAAGC CACUUGCUGU UGAAUUGUUA 
2401 CAUAUAGGUU UACOGUCAAG GCAAACAUUU 
2461 AUUGUUGGGA AAAGUGACOG CAAUUUAUCU 
2521 UCUAAGUUCA UAUCUUGCUG AGCACUUACC 
2581 CUAGCUGUAG AAACAAAAUG AUAGGAUUCU 
2641 AAUGOGUUUG AGAUAAACCC UCUCCCUAGA 
2701 CUAGUAAAGC AUGUGUUAAU UGCUAAAAGC 
2761 GCUUUCACCA AUCUUUGACA UUCAUUAGAA 
2821 UUGCCAUUAA CUAGGGAUUU UACAUCUUCC 
2881 AAAACAUUAA UACUAUCGUC UGUUAGAGGG 
2941 UUUGAUCCAU AGACAGCAUG CAUUACUACU 
3001 CAUCUCUCAG UCUGAAACCA CCUGAAAUUG 
3061 UAUGCAGAAU CUAAUAUUGG AAUCAGCAUG 
3121 ACACAUGUUU UAAAUGAACA AUAUGAAUCU 
3181 UUUUCCAUGU UUUCUUCAUA AGUCAUCUUC 
3241 GCUUUUGACC UUAACACUCU AUCUAAAGCA 
3301 UUUCUUUUAG AAGGGCUUGU GUACAUGAAC 
3361 AAUGGUGCUA UCAGAAUUGO UUCAGGAUUU 



CCUCUCCACA GCCAGACUUU ACAAAUUACA' 60 
GACUCUACAU UUAUAUACAC UAUAGAUUUG 120 
CUAAUCUAAU UCUAAAAUAG CAAUAAAUUU 180 
AOGGAUUGUU AGUCAUUACU UUCAAUUUAA 240 
UCAUCAAAGG CAUCUUCUUC CCCUCUUAGA 300 
UCAACAACCU CGUCCAACUC ACUUAUUUUA 360 
UAAUCAUUGA UAUCAGAGUA GGUAUAUCUU 420 
GGGUGCCUAG CAAGGAAAUU UGCGAUAUCC 480 
UGUUCAUCUC UUAAUUGUAU CAAAGCUUUU 540 
GACCUGCUUU UUGUCAGUUU UAGAUAACCU 600 
CAAUUGACAU UCUUAUAUUC CAUAUAAACA 660 
GAACUUAAUC UUCCCAAUCU GUCAGUUGAA 720 
AAUGUCUCUA UAUCAAAGCU GGUUCCCUUU 780 
CUUAGUAGAA GUGUGUUUGC UGGAAAUAAA 840 
UCUUUGAUCA AACCUUCAUA CAUUUCUUCC 900 
AGACCCACAA AUUCUUCAUC AGACCUAUCC 960 
CCAUUUUCUU UCAGUAUUUC CCUAGAUAOT 1020 
GUCUUGAAAU CAUAACUUUU UUCCAUUAAG 1080 
CUUAUUCUCC UCCCAAAUGU GGCAUAUGUC 1140 
UUUAAUAUAG AGUCAUUGUU GUGGCAUGUU 1200 
AACAAGUUGA AUUCCCCUAU CUCUAUGUGC 1260 
CCAGUGGCUC UAOAGCUUUU AAAUCCUGAU 1320 
AGCAGUAUUU CUUUGAAACC UUCUAAAUCU 1380 
UCUGAAAAAA GAGGACUAUA CUCAUUUUCA 1440 
AAUAGUUCUA CUGUGUCAUU AAAUCCAUUU 1500 
CUUUUGCAGU UUUCCAAACC UUCUAAGCAG 1560 
AUCGUUUCUC UGAUAUCAUC AUCUACAAUC 1620 
GCUUCUAUGA AAGUUAAAGA AGUCAUAAGC 1680 
UCAGAGUCUG CAGUUUUAUA CAUAUCAGAG 1740 
ACUAGAGAUU UUUGGCUUUC UAUAAUAUCC 1800 
AUCUUUACAA CUAAUUUUAC AUUCUCUAGU 1860 
ACAAUCAUUU UGUUGUCUGU GGUCAUUCUG 1920 
UCUGUGGUUG UUOGAUUUAU UAUGUUCACU 1980 
UCUAUACUUU CUCUGUAUGU GUCAAAAAGA 2040 
GUUCUCAAAG AAACUGCUAC ^AUCUUGAUCA 2100 
GUCAUGAUCC CUAUUUUGUA CAUAGUUCCA 2160 
UCAGUUUOUA CAUAUCUUAC AUCCCUUCUU 2220 
CUGUGAGAAC ACAAACCCAU GUUCCACAUC 2280 
UCACUGUUAU GUGAUAGAAU CACAUUUGAC 2340 
AUCUUUUUCA UUUUGUCUUC UAGACUCUGC 2400 
UGCAAUAUUA AAACDAUAUC CUCUCUUGUA 2460 
GAUAUGUAUA CAGCAGUGUC CAGUAUUUUC 2520 
ACAGUUCUGA UAGUUUUGUC UAAUUUAUAA 2580 
UCUUUACUGU AGAUGGUGGA UAGGAGAGCU 2640 
UUGAACUUAG AAUAGAAGGG AAAGCUUUUC 2700 
CUGUUUUGUG CUGCUAUCAU AUAAUUGCAU 2760 
UUGCUAAAAG CCUCUGUUUC GAAGCUGUCU 2820 
AUGAUUAAAC CUGUUCCAAA CGUGCOUGUU 2880 
AUGUUUUUUA AACCUAAGUU CUCUAUAUAA 2940 
AGUCCCGGAG AGCCUUCAAC AACAGGUAUC 3000 
UAGUUAUCUC UUUUCCUAGA UUCACAAGGG 3060 
CUGAAAUUAA CAUCCUUGAU AACUAGAUUU 3120 
AAAUCAAAUU UUAGCAUUUC UAAGAUUUUC 3180 
ACAGUGGAAG AAAUGUUUAC AAAUGUUCUA 3240 
AGUUUCUCUG UAGAUUGGUU CGAUAACUGG 3300 
AGCUGACUAA GUAAAAAAUC UCUAUCAUUC 3360 
UUUAUCAUUA UGUCAUGGAC AGAUUCCAUU 3420 



Figure 6B 
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3421 UCUAAGCUAG AUGCAAUCUU CtfUAUAUAAC 
3481 AUCUUUAAGU UCUUAGUGCO UUUAUAUAAA 
3541 UUUGAAAAAU CUUUAUAACU UGACAUCUUG 
36C1 AAUUUUAUAA UCUGAGUUAA* CAUAGUUUGA 
3661 'JCUUCAGACA CUGAUGCUUU CUCAAACAGA 
3721 UCWCCAGAQ UUCCUCUUUC AUACUUUUCU 
3781 CUGGAAGAAG AGACACUAUC UUWACOTCO 
3841 AOCACAUUGU AAUAGAUGAU UUGAUCAUUU 
3901 AOAGGAGAGG UCAACCAGCC CCCCAUCUUU 
3361 CCAAGOJUGU UAAAAAUUOJ AAUACUAUCA 
4021 CtfUAACGCUU GUACCUGCAC AGCCCCAUAA 
4081 UUUCUCAGAA GCAUUGUAAC AUGUAUACUG 
4141 UGCGAAGAUU CUGVGCAACA GUUUGO/AAA 
4201 UVUACAAUUC UUUCAGAUAU AAACOCUACU 
4261 UUCAAAGUUA UGCAAAAACU UUUGAAAUGA 
4321 AGAGAUGAGC UUGAAAAGUC UGUCAGCAUU 
4381 GUUGCAUUAU CAUCAGAGUG CACAAUCCAU 
4441 UCCAAAGUGU UGUGAUAAGC UU 



AUCAUGUCUU CAGAGUAAUG GUAAAUGGCU 
CAAGUCUCUA AUUCAGUCUU CAGUACUGAA 
GCAUAAUUAU UGAAAACAGA UGUGAUCUUA 
AGAAAAAAAG AGCCCUUAAA UCUUUUACCA 
AGUUUUUVCA UAUGAGUCAG AGAAGACAUG 
GCCUGCOJUU CAGAGGUAUC AUAOTCCAAG 
3UUUWAGAA WCUUTOCW AGAACUCAUU 
UAGGUAGUUU UGUVCUCCCU GCCUGUAUCV 
AAACACGAAC CUUGCUUAUU CCAUUCUUUA 
CCGGUGUWA UAUCUAUUUC AtfCUUCAAOT 
CWAGCUCUG CUAAACACAG UCUAACUAAG 
UUAUUUAUGC UACOAUAUGG GCUAUUAUAA 
CGAAACUUGU UAAUAAUUGC GUUGUUGUAA 
CUGGGGACCA AGUCAUGGAA AGUUAGUUW 
UACCUWCOO CAUAUUCAGC AGCAUUCUGG 
UUUUCTOUUC UCUUUUUUUC UCUAAUUCUA 
AGAGATOCCC AUAUOTGCOG CUUUUGGAUU 
UWGGCtTCUA TOUUUAAAUU GAUACAAGAG 
UGCUGGUCAU UCAUUGUUUC AAUGUGGAAA 
UCCUCUCUAG GGAUUUCAUC UGUAACAAGO 
UAGGCUACCU CAGUAGGAUV UCUUUCUUUG 
UCUUUAGAUC CAAAAUCAUC ACOTAAGAUtf 
UUUGCAUAAU AGUUUCUCAC CACUGUGGGU 
AUAUUOAGCA AUUUGGCAUU UVUAGCAUGC 
GAAUCUUCGA AUUUOJCCAA AACCCUGCUA 
GGUUUCAAUA UAACUAUAGA AACGAAAACA 
ACUUUUAUAU CUUUGAAAGA UUUCCAAAUA 
UCAGUCOJGG CAUCGACAGA UAUUUUCCAA 
UUUGAUUCUU UAUAUAUCAC AUAGWAOCA 
UUUAUUCUCU CAAACTCAUC AUCAGAGAGC 
UCCAGAUAUU CUCUUAGUGC AUGCUCUAUC 
CUCAGGUGCC UGGAGAUCAG CUCUCCAAAA 
UAUUUUUGCU CAAACAGAGA GACCAUCUCA 
CCUUUGUUUA GUCCUUCACC AUCAGCAGUG 
UAAUUUTOUG CAUUAUUAUU UAUAAUCACA 
UGUAAAUCCA AAGCUAGAUC GUGGUUAGAA 
GUGGUUCCAU UUUCUAUUAA UUUUUGUAUU 
UGUUACCUGA UUGCUCO (app. Ill 7300) 



UCAUCUUCCU CUAAAACAGC AUCCUCGUCU 3480 
UCUUCAUUCU UUUUUAAAUU AGGAUAAAGU 3540 
UUUAAAGCAO UCUCAUUUAU AAAAUUAGGU 3600 
AAUUUCAUCC CAAUUGUUAG AACGUCUGAA 3660 
UUGAUAAGAA AUAUCAUCUU UUUAUCUUUU 3720 
UUUAGCUCUC UGAAUCUCAU CUGUAGAUAA 3780 
UCUAAACUW UUUOGUUCAA AAAAUCUCUU 3840 
GAUGAUGGAC CUAAUAUAGA CAACGGCUCU 3900 
OTGGGAAUCU CGUWGACUU UAAACUUACU 3960 
VUCACUUCAC CAGGAAGCAU UGAAUAGAUG 4020 
GCAAAAGGUA UAACUUCAWJ AGGACAGCCTJ 4080 
AGUGACAUTJA GAUCAUCAAA AtXAACCUAUA 4140 
UGCCUGCAAU AGAGAGGAAU AAUCGCUCCA 4200 
UCUGAUGAAG AAGCAUAACU CUUUUUUGGG 426C 
GOJUCAAUGC UUO/AAACAA CAUUUCUGGC 4320 
UUAUCAACCU CUCCACUGGC UAUUAAUGAU 4380 
CUAGUUUGGA AAUCACAGUU UUUGUAACAC 4440 



GAAUUGC 

AGAUCAUUGC AUAAUUUGUV UAUCUUUUUG 
UCUUGAAUGU ACAAGCAUUC AGUAUUUUUA 
CCAUCCUCUA UUAUAUUAAA AUUAUCAAGU 
CUUAAGCUAU CAUCAtfUGUC AUGCCUUAUC 
UCUAUACUUG AGUAAUCAUC AUUUGGGUCA 
CCUWAGAGA UCCUWCWU UAUUUCUUCU 
CCVAUAWCA UUACUGCUAA GCCUUUAAAA 
UUCAUGCAAA AUUCAUUUUU GUUUCUGGUU 
GACUUUUUUG UUCUCUCAAC UVUGAUACUC 
UCCUGCUUUU CCAUUWCGU UGUAUCACAA 
UUUGUUAUAU CUCUUGCUW AUUUGUUUGG 
AUAACCCUUG AAGAACWW GAAGCUCUCC 
UCUACCUCUG AGCUCAAUUC UAUCUUCAAA 
AGUUCAGAGA GAUCAUUGAA ACAGUUAGGA 
UUAAAAGCUG UGGtJUAUGUC AGGUAUCCCt; 
AGCAUGAUCU UCUCAAGGUG UUUUCCAUAC 
AAGUGGUCUU UCAAAUCAGC UAAAUCAGCU 
UUUUGGUACC CUUUAAAUtJU GUCAGCUAUA 
CAGAAAUUUC CAAACAAAAA AUGOAUUAUU 
GAAUCAAUUA UUCUUUCAAU AUCUUCGGAA 
UAUUUGCUGG UCCAGUGACU GAAAAACACA 
CCUCUGACUU CAGAUUUGAU UCUUUGAGUG 
GUGGUUCCAA CUAUUUGCCC AACAUAAGAA 
UUUAUUCUUU CUCCGUGCAA UGCUGAUGGA 
CUAGUAACAG UGAUUGGCAU CCCAGCAAUA 
GGAUGCUCUU CCAGGAAUGG CUUUCCAOUC 
CUCUUCUAGG UAUUUCUCCA UUGUUUGGUU 
UCAUAAAUGA UUAAACAAAG CUCACUGUUU 
GGGGUGGCA0 UAGWGCUAC AUAUUCUUUA 
UUGUUAAUGC AGGACUUUUU GUUGAGUUCA 
UCCACUUCAU OUCUUUGUUU GGGCUUUAUU 
AUGOCAUGCC UUGCUAACUC UGUUUCUAGG 
CUUAGCUUUU UGACAUCCAC AGUUGCCAUC 
AUUUUGACAA UUAACUCUCU CAUUGUCUCA 
UCUUCUGGGA UCUCAUCACU AUUUCUCUUA 
CCUACACAAU CCUCAAUAGA CAACAGUAAA 
UUCUGGAUGU UCAUGUUUGC UUAAAAUCGU 
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Figure 7 Construction of expression vector pZIM 
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Figure 8 Construction of expression vector pZU-B 
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Figure 10 Construction of TSWV-Ns protein gene 
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Figure 1 1 Construction of plant transformation vector pTSVW-NAB 
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Figure 12 Construction ot plant transformation vector pTSWV-NmutBB 
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Figure 13 Construction of plant transformation vector pTSWV-NsAB 
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